首页> 外文OA文献 >A Graph-Based Semi-Supervised k Nearest-Neighbor Method for Nonlinear Manifold Distributed Data Classification
【2h】

A Graph-Based Semi-Supervised k Nearest-Neighbor Method for Nonlinear Manifold Distributed Data Classification

机译:基于图的半非线性k近似非线性方法   流形分布式数据分类

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。
获取外文期刊封面目录资料

摘要

$k$ Nearest Neighbors ($k$NN) is one of the most widely used supervisedlearning algorithms to classify Gaussian distributed data, but it does notachieve good results when it is applied to nonlinear manifold distributed data,especially when a very limited amount of labeled samples are available. In thispaper, we propose a new graph-based $k$NN algorithm which can effectivelyhandle both Gaussian distributed data and nonlinear manifold distributed data.To achieve this goal, we first propose a constrained Tired Random Walk (TRW) byconstructing an $R$-level nearest-neighbor strengthened tree over the graph,and then compute a TRW matrix for similarity measurement purposes. After this,the nearest neighbors are identified according to the TRW matrix and the classlabel of a query point is determined by the sum of all the TRW weights of itsnearest neighbors. To deal with online situations, we also propose a newalgorithm to handle sequential samples based a local neighborhoodreconstruction. Comparison experiments are conducted on both synthetic datasets and real-world data sets to demonstrate the validity of the proposed new$k$NN algorithm and its improvements to other version of $k$NN algorithms.Given the widespread appearance of manifold structures in real-world problemsand the popularity of the traditional $k$NN algorithm, the proposed manifoldversion $k$NN shows promising potential for classifying manifold-distributeddata.
机译:$ k $最近邻($ k $ NN)是用于对高斯分布数据进行分类的最广泛使用的监督学习算法之一,但是将其应用于非线性流形分布数据时,尤其是在标记数量非常有限的情况下,其效果不佳可提供样品。在本文中,我们提出了一种新的基于图的$ k $ NN算法,该算法可以有效地处理高斯分布数据和非线性流形分布数据。为了实现这一目标,我们首先通过构造$ R $-来提出约束疲倦随机游走(TRW)。在图上建立最近邻强化树,然后计算TRW矩阵以进行相似性测量。此后,根据TRW矩阵识别最近的邻居,并根据其最近邻居的所有TRW权重之和确定查询点的类别标签。为了处理在线情况,我们还提出了一种新算法来处理基于局部邻域重构的顺序样本。在合成数据集和实际数据集上进行了对比实验,以证明所提出的new $ k $ NN算法的有效性及其对其他版本的$ k $ NN算法的改进。鉴于流形结构在实际环境中的广泛出现,由于存在很多问题,并且由于传统的$ k $ NN算法的普及,所提出的歧管版本$ k $ NN显示了对流形分布数据进行分类的潜力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号